MiCROCAUSALITY IN CURVED SpACE-TiME 



Sergei Dubovsky'^'^, Alberto Nicolis^'*^, 
Enrico Trincherini^, and Giovanni Villadoro^ 

^ Jefferson Physical Laboratory, 
Harvard University, Cambridge, MA 02138, USA 

^ Institute for Nuclear Research of the Russian Academy of Sciences, 
60th October Anniversary Prospect, la, 117312 Moscow, Russia 

^ Department of Physics, Columbia University, 
New York, NY 10027, USA 



Abstract 

It is well known that in Lorentz invariant quantum field theories in flat space the com- 
mutator of space-like separated local operators vanishes (microcausality). We provide two 
different arguments showing that this is a consequence of the causal structure of the classical 
theory, rather than of Lorentz invariance. In particular, microcausality holds in arbitrary 
curved space-times, where Lorentz invariance is explicitly broken by the background metric. 
As illustrated by an explicit calculation on a cylinder this property is rather non trivial at 
the level of Feynman diagrams. 



1 Prelude: Microcausality in Flat Space 



One crucial property of any consistent relativistic quantum field theory in Minkowski space is 'mi- 
crocausality'. Microcausality is the statement that the commutator of local (Heisenberg-picture) 
operators evaluated at space-like separated points vanishes, 

[0^{x),OM] =0 for (x-y)2<0, (1) 

and it is an exact, non-perturbative statement about the quantum theory. Its relation to causality 
is obvious: according to quantum mechanics, if two operators commute measuring one cannot 
have any influence on measuring the other. 

Microcausality holds as an operator equation. However it is also instructive to consider its 
vacuum expectation value (vev), since the vev of the commutator of two local operators describes 
the causal response of the vacuum to external sources. Indeed, it is the generalization of the 
retarded Green's function to an interacting theory. Consider for instance a real scalar with 
generic self interactions, coupled to a local external source J(x), H = -\- J d^x J(j). We can ask 
what is the vev of at some point y = {y'^,y) if we start with the vacuum at t = — oo and we 
then turn on a localized source J for a finite period of time. In the interaction picture, treating 
J0 as the interaction, this is 

{(f)(y))j = (o|f e'^i^*'^'"^'^ 0(?/) Te-'^--'^*'^'"-^'^|0) , (2) 

where by T and T we denote time-ordered and anti-time-ordered products, respectively. The above 
formula can be easily derived through Schwinger's 'in-in' formalism (see ref. [1] for a nice review). 
Expanding in the external source J we see that at leading order the vev of the 00 commutator 
plays the role of a Green's function for 0, 

= / d'x J{x) ■ I (0| [0(x), 0(1/)] |0) + . . . (3) 

J — oo 

If the 00 commutator vanishes outside the light cone, then the source can only generate a non-zero 
field inside its future light cone. In the following by 'microcausality' we will refer both to the more 
general operator statement and to its vacuum expectation value. 

Microcausality is crucial in making the 5- matrix Lorentz invariant. The S'-matrix is con- 
structed in terms of T-ordered products of local field operators. The T-ordering of fields evaluated 
at space- like separated points x and y is in principle not Lorentz invariant, since by a Lorentz 
boost one can invert the time-ordering of x and y. A sufficient condition for recovering Lorentz 
invariance is that field operators at space-like distance commute. 

For a Lorentz-invariant theory microcausality can be proven in a variety of ways (see, e.g., 
ref. [2]). One proof makes use of the Kallen-Lehmann representation. The two point-function can 
be written as a sum over intermediate states, 

(0| 0(x)0(i/) |0) = / Vp(/i') A+(x - y- 1?) , (4) 
Jo 
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where /^^ is the intermediate states' invariant mass, p{n'^) is the spectral density function, and 
A+(x — y; //^) is the free scalar two-point function for a particle of mass /i: 

A^x - y; ^i') = / d'p ^(/) e^^<^-y^5(p' + ^i') . (5) 

Then the causal properties of the full quantum two-point function are the same as in a free theory. 
In particular 

/•CO 

(0| [cf>{x), ci>{y)] |0) = / (0| [cf>{x), ci>{y)] |0)free,mass=M , (6) 

which vanishes for space-like separated x and y. 

Also microcausality follows directly from canonical quantization of the theory. The system 
is quantized by imposing canonical commutation relations at t = and then evolving forward 
and backward in time through the unitary time-evolution operator defined by the Hamiltonian. 
Canonical commutation relations are preserved by the time-evolution. If Lorentz-boosts are a 
symmetry of the theory, they are represented in the Hilbert space of the theory through unitary 
operators. These too preserve the canonical commutation relations. This means that field opera- 
tors at space-like distance will commute, since by a Lorentz-boost space-like separated points can 
be brought to lie on an equal time surface, where the corresponding field operators commute. 

A less familiar argument which will be useful in the following exploits the Euclidean continua- 
tion of the theory. The vacuum expectation value of the (fxp commutator is given by the imaginary 
part of the Feynman propagator, 

(0| [(/.(x), (/.(y)] |0) = i sign(x° - y') ■ Im (0| T{(/.(x)0(?/)} |0) . (7) 

(More generally, [0,0] as an operator is given by the anti-Hermitian part of T{</)0}.) In the 
Euclidean theory the Feynman propagator between two different points is real, being a path- 
integral of manifestly real quantities, 

(O|r{</)(O)0(xs)} |0) = J P(/-e-^-[^l0(O)0(xB) . (8) 

Of course we arc assuming that the above path-integral can be made sense of by the removal of 
all UV divergences through the usual renormalization procedure. In this case the only residual 
divergence we can have is in the limit xe ^ 0. Now, the Euclidean theory is obtained from the 
Minkowskian one via the identification t = itj^, with real tj^. If the theory is Lorentz invariant 
the propagator can only be a function of the invariant combination t\ + = —t^ + x^. Therefore 
the invariant statement is not simply that the Feynman propagator is real for real — that is for 
t% > 0. Rather, it is that the Feynman propagator is real for positive — -|- a;^, i.e. outside the 
light cone. In other words, when we talk about the Euclidean continuation of a Lorentz invariant 
tlieory we are automatically talking about the region outside the light cone in Minkowski space. 
The Euclidean origin corresponds to the whole light cone. The analytic continuation back to 
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Minkowski space can be seen as a continuation into the light-cone — leading to a non-zero imaginary 
part of the propagator, as a result of the ie prescription for circumventing the singularity localized 
on the light-cone (= Euclidean origin). As long as we stay outside the light-cone the Feynman 
propagator is manifestly real, and thus eq. ([7]) vanishes. 

The point we want to stress here is that all these simple arguments assume, and crucially rely 
on the exact Lorentz invariance of the quantum theory. This is explicit in all the derivations we 
sketched above, with the exception of that based on the Kallen-Lehman representation. There 
too however Lorentz invariance is necessary in order to arrive to eq. (jll) in the first place, that 
is, to write the full propagator as a sum of free propagators. Also, without making explicit use 
of Lorentz invariance it is extremely hard to keep track of microcausality in the perturbative 
expansion — for instance by checking whether loop corrections to the Feynman propagator in real 
space give a non-zero imaginary part outside the light cone (see, e.g., the example in sect. S]). 

What happens then when we consider a local relativistic QFT in a fixed curved space-time? 
Such a theory is not Lorentz invariant. The fact that the classical Lagrangian is locally Lorentz 
invariant tells us that Lorentz invariance is going to be an approximate symmetry of the theory 
at high momenta/short distances. In particular the UV properties of the theory, like e.g. renor- 
malizability are not affected by the presence of a curved background (see e.g. p5]). But certainly 
Lorentz invariance will not be an exact global symmetry of the full theory. Even for metrically flat 
but topologically non-trivial spaces (i.e., tori) there are calculable IR effects that break Lorentz 
invariance, the simplest example being the Casimir effect. It is therefore not manifest whether 
microcausality is going to be a generic feature of QFTs in curved space-times, or whether instead 
it may be impaired by Lorentz-violating IR effects. Definitely all arguments mentioned above 
are not directly applicable to this case. Indeed the possibility that microcausality be generically 
violated in curved space-time has been the object of recent investigations [3 [S], leading to the 
striking claim that QED itself at one-loop level features microcausality violations. 

The purpose of the present paper is to show that microcausality does hold in curved space-time. 
We start in sect.[2]by reviewing the connection between causality and analytic dispersion relations, 
such as the Kramers-Kronig relations between the real and imaginary parts of the refraction index 
in classical electrodynamics. We explain why the Kramers-Kronig relations are not applicable to 
quantum corrections in a QFT and why instead a more general set of dispersion relations have to 
be used in this case. These are of course very well known notions; the main point we are stressing 
here is that to check the causal properties of a theory via analytic dispersion relations one needs to 
know the full off-shell structure of the propagator in momentum space. One can easily get confused 
by applying indirect criteria based on notions of group velocity or Kramers-Kronig relations for 
the on-shell refraction index. Indeed, in the literature there is a rather extensive discussion on 
"superluminal" effects in QED, e.g. O [TOl [TTl [8] . To avoid these subtleties, we will focus directly 
on the commutator in coordinate space. 

We then give two general arguments for microcausality in curved space-time, one based on the 
path-integral formulation of the theory, the other on canonical quantization (sect. [S]). As we will 
see, the crucial result is that the causal structure of the full quantum theory is the same as that 
of the corresponding classical field theory. In particular, two local field operators will commute 
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outside the light-cone whenever the classical theory cannot propagate information outside the 
light-cone. Notice that from this viewpoint the central role usually given to Lorentz invariance at 
the quantum level in proving microcausality in Minkowski space is somewhat misleading. Lorentz 
invariance may be crucial in ensuring the relativistic causal structure of the classical field theory, 
but from that point on microcausality is just a consequence of canonical quantization. 

These arguments are implicitly known in the literature. For example the formulation of the 
black hole information paradox in terms of nice slices is closely connected with our path integral 
argument (see e.g. [12] )• Still we felt it useful to give a dedicated discussion on the subject. 

The hypothesis that an interacting classical field theory does not propagate information outside 
the light cone is a highly non-trivial requirement, even in Minkowski space. A theorem due to 
Leray (see e.g. [1] and references therein) gives a general result for a system of PDE's that are 
linear in the fields' second derivatives and have the general form 



where G^'^ is a globally hyperbolic effective metric, smooth function of the space-time point x as 
well as of the fields and their first derivatives, V is any derivative operator, and the F's are smooth 
functions. The theorem states that in such a system the causal structure, i.e. the causal depen- 
dence of solutions on initial conditions, is determined by the light-cones of the effective metric G^^. 
Information propagates inside or along such light-cones. Notice that the effective metric in general 
depends on the field configuration, and so the causal structure will in general depend on the solu- 
tion. Now, for the classical relativistic Lagrangians that define usual renormalizable QFT's, the 
classical field equations are of the form ([9]) with G^y independent of the fields and their derivatives, 
and given simply by the (flat or curved) space-time metric g^^. In this case the causal structure 
is given by the light-cones of g^y, as expected. For non- renormalizable Lagrangians however, even 
if the field equations have the form ^ the effective metric G^^ is generically different from g^^, 
and superluminal propagation of signals may appear at the classical level about non-trivial field 
backgrounds, already in Minkowski space [5] . However a non-renormalizable Lagrangian does not 
lead to a well defined quantum theory, unless of course it admits a renormalizable UV completion, 
in which case such superluminal effects are absent. We can therefore safely assume that the field 
theories we will be interested in have classically a well-posed initial value formulation with no 
superluminal propagation of signals. 

As an example of microcausality in Lorentz-violating space-times, we present an explicit one- 
loop computation in a non-trivial background (sect. H]). To keep things as simple as possible, we 
consider a scalar field with cfr" self-interactions on a cylinder, i.e. on a D-dimensional Minkowski 
space with one spatial dimension compactified on a circle. Although such a space-time is metrically 
flat, the compactness of one spatial direction explicitly breaks Lorentz invariance, thus evading 
all traditional arguments reviewed above based on Lorentz invariance. Indeed, at tree level the 
two-point function is the sum over images of the free two-point functions in Minkowski space 
(see fig. [1]). Although not Lorentz invariant, it is at least causal in the sense that the retarded 
tree-level propagator vanishes outside the light cone — trivially, because the image contributions 
are not helping in going faster than light (going around the cylinder takes longer!). At one loop 




(9) 
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Figure 1: Left: The tree-level propagator on the cylinder is the flat-space tree-level propagator summed 
over all images. The red line is the first topologically non-trivial contribution. Right: At one-loop, apart 
from the sum over images of the one-loop propagators there are additional contributions coming from 
loops 'wrapping' around the circle. 

however, on top of the sum over images of the one-loop fiat-space propagators, there are new 
contributions coming from loops wrapping around the circle. We will show that this effect cannot 
lead to any imaginary part of the Feynman propagator outside the light-cone. 
We present our conclusions in sect. [5l 

2 Causality, Locality and Analyticity 

It is well known that causality and subluminality (as we will discuss shortly, in general these are 
different properties) manifest themselves as certain analytic properties of the Green's functions 
in momentum space. To see how this connection arises, let us consider a field that satisfies an 
equation of the following general form 



where G(t, x) is a solution of the field equation (fTOj) with a delta-function source at the origin 
6(t,x). A system described by (fTOj) is called causal if its Green's function can be chosen to satisfy 
the boundary condition 




(10) 



A generic solution to this field equation can be presented in the form 





for t <0 . 




5 



This property is just saying that the response of the system to a source cannot show up be- 
fore the source was turned on, and clearly should hold for all physical systems, both relativistic 
and non- relativistic. Note also that this discussion applies both to classical fields and to the 
vev's of quantum fields. In the latter discussed in sect. [H G(t,x) is the commutator 



Causality imposes strong restrictions on the possible forms of the Green's function G (and of 
the operator F). To see this let us perform the Fourier transform of the Green's function with 
respect to time, and define 



Causality (that is, eq. ( ITT]) ) implies that the integration interval in ( |T2l) is actually t G [0, oo). 
If one furthermore assumes that the system is stable, i. e. that the response to the source grows 
slower than an exponent with time, then G{uj,x) is analytic in the whole upper half complex u 
plane, Im u > 0. In principle, one can relax the last assumption and consider systems containing 
tachyonic degrees of freedom leading to exponential instabilities with instability rate F. In order 
to be causal the Green's function of such a system has to be analytic at Im > F. For simplicity, 
in what follows we assume that the system is stable. 

In order to see the implications of locality let us perform also the Fourier transform with respect 
to the spatial variables x, and consider 



A very mild notion of locality implies that this Fourier transform exists at all real p, and combined 
with causality it results in the analyticity of F{uj,p)~^ as a function of u at all real p and at 
Im u > 0. Of course, usually by locality one understands much stronger restrictions on the 
properties of the system, which in turns restricts more the analytic properties of the function 
F{u!,p)~^. For instance, it is reasonable to assume that at each moment t the response of the 
system to the delta-functional source is non-zero only in a finite region of space. In this case F~^ 
is an analytic function of both u and p at real p and Im u > 0. Subluminality is a stronger version 
of the last property which holds in all systems described by a relativistic quantum field theory. It 
says that G{t,x) is causal and vanishes outside the light-cone, i.e. when t < \x\. This translates 
into the analyticity of the function F{uj,p)~^ in the region Im u > |Im p\. 

An important thing to keep in mind is that to check what the causal properties of a given 
system are in general one needs to know the whole off- shell iunction F{uj, p) describing the coupling 
to a local source as in eq. ffTOj) . It is not enough to know just the dispersion law for free waves, 
i.e. the relation between u and real p such that F is zero. To illustrate this important point let 
us consider several specific examples. Let us start with a function F of the following form 



This kind of function appears, for instance, when one considers the propagation of light in a 
dispersive medium with n{uj) being the refraction index of the medium. Indeed precisely in this 



e{t)mt,x),mm- 




(12) 




(13) 



F{iu,p) = n [LU) ■ LU —p . 



(14) 
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context the relation between analyticity and causality was first exploited by Kramers and Kronig 
to derive their famous dispersion relations It is instructive to see how the above F{uj,p) 
appears in a field theory language. Namely, a non-relativistic medium can be thought of as a set 
of decoupled localized degrees of freedom (atoms/molecules of the medium) which can interact 
with the photon. For instance, approximating each degree of freedom as an harmonic oscillator 
of frequency M, one can write the following effective field theory action describing such a system 

C = \{d,Ar + i(X)2 - ^-M'X' - gAX , (15) 

where X is a field describing local degrees of freedom and A stands for the photon field (for 
simplicity, we are suppressing all vector indices). By integrating out the X field, one obtains the 
effective action describing the propagation of the photon field with a kinetic function of the form 
(fT4l), with 

"'<'^' = ' + ■ <^«) 

This simple model actually provides an accurate description of the atomic contribution to the 
dielectric constant. 

According to the above discussion, in order to describe a causal system the function should 
be analytic at all real p and Im u > 0. If F has the form f|T^ this condition implies that the 
refraction index n{uj) is also analytic. Indeed, if it has an essential singularity, then clearly F~^ 
has an essential singularity as well. If n{uj) has a branching point uq, then in order for ujq not to 
be a branching point of this should be a square root branching, i.e. in the vicinity of c^o, one 
has either n{u) ~ (c<j — cuo)"*^^^, or n{uj) ~ (a; — ujq)~^^'^. It is straightforward to check then, that in 
the first case, F~^ has a pole in the vicinity of at small real p, while in the second case it has 
a pole in the vicinity of Uq at large real p. Analogously, the existence of a pole singularity of n{u) 
would lead to a pole of F~^ at large real p. So the requirement of causality imposes very strong 
constraints on a kinetic function of the form (JT^ . In particular, in this case causality almost 
implies subluminality; the only extra conditions needed are that at short scales the dispersion 
relation takes the relativistic form, i.e. n —>■ 1 when uj — > oo, and a certain positivity assumption. 
Indeed, for the retarded propagator one has (for simplicity, we are considering (1 + l)-dimensional 
space-time) 

/ -i{ujt+px) 
dudp . (17) 

Let us consider the case x > 0; then by closing the contour of integration in the lower part of the 
p plane and performing the integral over p one obtains 

G(t,x) = (27rz) [ ^e-iM-~) ^ 
J 2nuj 

where 

a = sign(c(j Im n) . 
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If a is positive everywhere on the real axis, then it can be dropped from the integral (ITSl) . Then, 
at X > t the asymptotic condition n{uj oo) — 1 allows to close the contour of integration in the 
upper half plane of u and the analytic properties of n discussed above imply that G(t, x) is zero 
outside the light-cone. 

As an another example let us consider a kinetic function of the form 

Fiu;,p)=cu'-fip'). (19) 



Clearly, in many cases this function can give rise to the dispersion law for free waves as eq. f[T4j) : 
however, as we will see now, it has completely different causal properties. Indeed, this kinetic 
function corresponds to a second order differential equation in time, so that the existence of a 
causal Green's function is automatic, and causality does not impose any constraints on the analytic 
structure of fijP')- As long as /(p^) is real and positive on the real axis, the kinetic function (lT9il 
describes a stable causal system. However, as we are going to show now, unlike in the previous case 
the subluminality requirement is extremely powerful in this case — the only possible function / that 
leads to a retarded propagator that vanishes outside the light-cone is the quadratic polynomial, 

fij)) = ap"^ + bp + w? . 

To see this, it is convenient to use the subluminality condition discussed above, namely that 
F{uj^p)~^ is an analytic function of both its arguments in the region Im uj > |Im p\. This condition 
immediately implies that the function / should be analytic in the whole complex plane. It is 
slightly non-obvious that / cannot have a pole at some po, but it is straightforward to check, 
that in the vicinity of the pole one could always find a point such that Im /(p*)^/^ > |Im 
thus violating the subluminality condition. To complete the proof we will use only one extra 
assumption. Namely, we will make use of the fact that for reasonably local physical systems the 
Fourier transform of the retarded propagator with respect to the spatial coordinate, which is equal 
to 

sin (/(p)V^t) 

^(^'^)= fXpy/2 

is an exponentially bounded function of the spatial momentum p. This implies that / cannot 
grow faster than p"^ at infinity, and the only analytic function with this property is the quadratic 
polynomial. 

We see that the two kinetic functions fll4l) and fll9p describe systems with very different causal 
properties, in spite of the fact that they can lead to identical on-shell dispersion laws. As a very 
concrete example, let us consider a kinetic function of the form 

Fi = u;'- . (20) 

+ 1 

According to the above discussion this kinetic function corresponds to a perfectly stable and causal 
system, which does not however possess a light-cone. The on-shell dispersion law following from 



8 



this kinetic function is the same as that following from 




(21) 



This kinetic function has the form (fMl) . and it does not describe a causal system due to the 
presence of the branch cut in the upper half-plane of uj. This somewhat counterintuitive result is 
easy to understand. The point is that it does not matter which one of the functions Fi 2 to put in 
eq. ffTOj) as long as sources are absent, j = 0: the free solutions are the same. However, the relation 
between the two different sources that for the two different choices of the kinetic function create 
identical field configurations, is highly non-local. As a result the causal properties of the system, 
in its response to local sources are very different in the two cases. In other words, eq. ( fTOj) . apart 
from providing a free wave dispersion law, carries a physically very important extra information — 
it describes how the system is coupled to local sources. Without access to this information one 
cannot decide about the causal properties of the system. 

Of course, the kinetic functions arising in relativistic quantum field theories have neither the 
form fll4p . nor the form fll9p . According to the above discussion, in order to draw any conclusion 
about the causal structure of the theory from the direct study of the analytic properties of the 
kinetic functions, one needs to know the full off-shell result. For instance, a recent work [8] 
discussed the fate of (micro) causality in QED in curved backgrounds. At one loop the kinetic 
function in general has the form 



However, the authors of [8] brought it into the form (fT4|) by means of the tree on-shell condition, 
and it was found that the resulting n{uj) does not enjoy the conventional analytical properties, 
i.e. the Kramers-Kronig relations. Due to the reasons explained above, this is not very surprising 
and by itself does not indicate a breakdown of (micro) causality. Actually, the necessity to know 
the full off-shell structure makes it somewhat unpractical to study the analytic properties of the 
propagator in momentum space to check microcausality in curved backgrounds in perturbative 
calculations. Instead, in what follows we will study the issue directly in position space. 



In this section we will present two arguments that prove microcausality without relying on Lorentz 
invariance. The arguments are very similar in spirit, but use somewhat different formalisms. Both 
of them show that the actual content of microcausality is that the quantum theory inherits the 
causal structure present in the corresponding classical theory. 

3.1 Functional Integral Argument 

We will now show that the commutator of two operators Oi^2 localized at any two points belonging 
to the same space-like hypersurface E vanishes. The essence of the argument is that one can 




3 Microcausality in Curved Space 
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Figure 2: For any two space-like separated points at different times (ti,t2) one can choose a set of 
coordinates such that they lie on the same time slice. 

quantize the theory by choosing the time coordinate in such a way that the surface S corresponds 
to a constant time shce. With this choice of coordinates we are deahng with an equal-time 
commutator, C2(t)], which is zero. Indeed, in canonical quantization 

0{t,x) =fe'^~^^0{-oo,x)Te-'^-^^, (22) 

so that 

[0,{t, xi), 02{t, X2)] = f e*^-- ^ [Oi(-oo, X,), 02(-oo, 0:2)] Te"*^-- ^ = , 

where the last equality holds just by the definition of the canonical quantization. The only part 
of this argument that requires extra justification is the starting point — namely the statement that 
the result of the quantization does not depend on the choice of the time coordinate. To prove 
this desirable property it is convenient to use the functional integral representation of the matrix 
elements. Suppose that we quantized the system according to some time-slicing with time variable 
t, and let us consider the matrix element 

M = {HOi{ti,xi)02ih,X2M2) , (23) 

where \ipi,2) are any two states. 

By making use of (l22l) . which determines the time evolution of the Heisenberg operators, one 
can rewrite Ai in the following form (for definiteness, we assume here that ti > t2, and the 
extension to the case ti < ^2 is straightforward), 

M = (7/'i|fe*^--^Te"'^*^i^Oi(-oo,a:i)Te-*^4'^02(-oo,X2)Te~'^--^|7/'2) , (24) 
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where r is an arbitrary moment of time later than both ti and t2, t > ti^2- Now we can use the 
conventional representation of the (anti)time ordered products in the functional integral form and 
write (cf. e.g. pQ) 



Here the integral over (left) right fields 0(l)_r represents the (anti)time ordered exponents and the 
boundary conditions at t = — oo are determined by the states |'0i,2) in the matrix element flMl) . For 
ti < t2 one obtains a similar formula, the only difference being that operators Oi^2 will depend now 
on the "left" fields 0L(ti,2)- The functional integral formula f l25l) provides a representation of the 
matrix elements that is totally independent of the choice of time slices used to quantize the system. 
Consequently, to calculate the commutator in the operator formalism one can canonically quantize 
the system using whatever choice of the time variable one finds convenient, as long as it provides 
a consistent Hamiltonian description. In particular in field theories with two-derivative kinetic 
terms in curved backgrounds for any two operators belonging to a single space-like hypersurface 
one can choose a time variable using this hypersurface as one of the slices, as we did above to 
show that the commutator vanishes. Notice that this procedure fails if the two points are time-like 
separated — there can be no consistent Hamiltonian evolution connecting a space-like hypersurface 
with a time-like one. Indeed, in this case somewhere in between the two slices the g^^ component 
of the metric vanishes, so that the equations of motion reduce to non-dynamical constraints and 
the Hamiltonian system degenerates (see |5j for a related discussion). 



3.2 Canonical Quantization Argument 

Now we give another argument for microcausality in curved space-time entirely based on canonical 
quantization. We will show that in curved space-time the causal structure of a quantum field 
theory is the same as that of the corresponding classical theory. More precisely, if the classical 
theory one starts with cannot propagate information outside the light-cone, then upon canonical 
quantization of the theory commutators of field variables will vanish outside the light cone. 

Let's start with a generic mechanical system. Suppose we have a classical Hamiltonian system 
with canonical variables (gi, . . . ,qN',Pi, ■ ■ ■ ,Pn), collectively denoted by (g,p), and Hamiltonian 
H{q,p). At equal times the Hamiltonian variables obey canonical Poisson brackets, {qi{t), qj{t)} = 
0, etc. But what about Poisson brackets of variables at different times, for instance {gi(0), q2{t)}'? 
These depend on the dynamics. Indeed by solving the Hamilton equations 

qi = {qi, H} , pi = {pi, H) (26) 

one can express all canonical variables at time t as functions of the initial conditions (g(0),p(0)) 
given at some earlier time t = 0. So for example 

g2(t) = Q2(t;g(0),p(0)) . (27) 
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Then the Poisson bracket we are interested in is simply 



dQ2{t;q{0),p{0)) 



(28) 



In particular, if for any dynamical reason the solution for q2 at time t does not depend on the 
initial condition pi{0), then the Poisson bracket between qi{0) and q2{t) will vanish. 

We now quantize the system by promoting all Hamiltonian variables to operators — which 
we will denote by 'hatted' variables q, p — and imposing canonical commutation relations. In 
Heisenberg picture we can ask again what is the commutator of canonical variables at different 
times, e.g. [gi(0), g2(t)] • 

The Hamilton equations now read 



By definition the quantum Hamiltonian H{q,p) has the same functional dependence on the op- 
erators q, p as the classical Hamiltonian has on the classical canonical variables q, p. The only 
difference resides in possible ordering ambiguities, which are absent in the classical theory whereas 
in the quantum one they can lead to physically different systems. Also the algebra of canonical 
commutation relations in the quantum theory is the same as the algebra of (i x ) canonical Poisson 
brackets in the classical theory. As a result of these facts the quantum canonical variables q,p 
satisfy operator Hamilton equations that are functionally the same as the classical ones, again 
apart from ordering issues. This property is often referred to as 'the quantum variables satisfy the 
classical e.o.m. as operator equations'. As a consequence the operator solutions to the quantum 
Hamilton equations will be functionally the same as the classical solutions, in their t-dependence 
as well as in their dependence on the canonical variables at some earlier time t = 0. So for example 
the operator q2 at time t will be 



where Q2 is exactly the same function appearing in the classical theory, eq. fl271) . with some properly 
chosen ordering of the g's and p's. 

We now can use the classical solutions to compute commutators of variables at different times 
in the quantum theory. So for example 



where Q2 is the classical solution for q2{t), with some specific ordering. Now, the point is that if 
for any reason in the classical theory the solution for q2 at time t is not sensitive to the value of 
Pi at t = 0, then the above commutator will vanish. This is because pi(0) is the only canonical 
variable at t = with which qi{0) does not commute. But if the classical solution for q2{t) does 
not involve pi{0), the quantum solution for q2{t) will not involve the operator pi{0). In this case 




(29) 



g2(t) = Q2(t;g(0),p(0)), 



(30) 



[gi(0),g2(t)] = [gi(0),Q2(t;g(0),p(0))] , 



(31) 
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there will be no pi{0) in the right argument of the commutator (I3T1) . and the commutator itself will 
vanish. This is obviously the quantum analogue of eq. (125]) . Notice that in this respect ordering 
ambiguities are harmless: they cannot change whether the solution depends or not on some of the 
initial conditions. 

It is now clear where we are headed. All the discussion above applies unaltered to a field 
theory in curved space-time, which is nothing but an Hamiltonian system with infinitely many 
degrees of freedom — one at each point in space. For any given time-slicing of the space-time 
manifold, we call (pg the classical field variables and 11^ their conjugate momenta, with x being a 
set of spatial coordinates parameterizing the time slices. By solving the Hamilton equations one 
can in principle express the field at time t and position x as a functional of the initial field and 
momentum configurations given on some earlier time slice, for instance at t = 0, 

Mt) = ^4t-^mMo)) , (32) 

where by 0(0), n(0) we denote the field and momentum configurations on the whole t = slice. 
Then, as in eq. (128|1 . the Poisson bracket of field variables at different times is given by 

{MO)Ay{t)\ = sfTio) ' 

where S{- ■ ■)/S{- ■ ■) is a functional derivative. 

The system is quantized by imposing canonical commutation relations. We are then interested 
in the commutator of field operators at different space-time points, e.g. [0^(0), 0j7()f:)] . Then in 
this case eq. (I3T1) reads 

[MO),Mt)] = [0.^0), %(t; 0(0), fl(0))] , (34) 

where is the same functional appearing in the classical solution, eq. (l32ll . Now, 

(i) if the points (x, 0) and {y,t) are space-like separated, and 

(a) if the classical theory cannot propagate information outside the light cone, 

then the classical field at {y, t) cannot depend on the initial conditions given at the point (x, 0). It 
can only depend on initial conditions given inside the past light cone departing from [y, t) , see fig. El 
Hence, the functional ^y{t; 0(0), H(0)) does not depend on H^(0). This leads to the vanishing of 
the classical Poisson bracket eq. (133|) : but it also leads to the vanishing of the commutator eq. (IMI) 
in the quantum theory, 

[MO),Mt)] = 0, (35) 

since nx(0) is the only canonical variable at t = with which 0:^(0) does not commute. In this 
precise sense the causal structure of the quantum theory is the same as that of the classical one. 
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Figure 3: In a classical theory that does not admit superluminal propagation of signals, the field at 
{y,t) can only depend on initial conditions given inside the point's past light cone (shaded region). In 
particular, it cannot depend on the field and momentum values at {x,0). 

These two arguments can be straightforwardly extended to gauge theories as welL The key is 
to work in a gauge with a well-defined Hamiltonian evolution. For instance, in the first argument 
one can start by defining the matrix element fl23l) in the Aq = gauge, or in covariant form 



where is the time-like normal vector to the selected set of slices. In this gauge, one obtains 
an Hamiltonian system with canonical variables {Ai,Ff)i) and a Gauss's law constraint C, which 
generates the residual gauge transformations. Upon quantization this constraint is used to define 
the physical states \ip)^s by imposing the condition 



Now one can construct the functional integral representation of the matrix elements. The only 
difference with the case discussed above is that naively this functional integral depends on the 
choice of time slices through the vector n'^. However, integrals with different n'^'s differ only in 
the choice of the gauge fixing condition ( l36l) so that by making use of the conventional Faddeev- 
Popov trick one can show that the matrix elements between physical states do not depend on n^. 
Consequently, physical matrix elements do not depend on the choice of the time slices used to 
quantize the system, and one can run the argument as before. 

Similarly, one can use the same gauge to run the analogue of the second argument. In the 
classical theory all solutions with the same initial data differ just by a gauge transformation and, 
in particular, they are described by pure gauge configurations outside the light cone. Corre- 
spondingly, following the same logic as above, Poisson brackets (and commutators in the quantum 



AX = , 



(36) 




(37) 



14 



theory) are pure gauge outside the hght cone. Then the condition (1371) imphes that their matrix 
elements between physical states vanish. 

Finally, it is worth mentioning that it is straightforward to include fermions too, for instance 
the first argument applies to systems with fermions without any modifications. 

To conclude this section, it is worth stressing that these two arguments can be applied in 
more general contexts because they do not rely at all on the Lorentz invariance of the theory in 
flat background. They will go through for any non-relativistic theory allowing an Hamiltonian 
description with a well defined causal structure at the classical level. As we already stressed their 
real message is that the causal structure of the quantum theory is inherited from the classical the- 
ory without any modifications. For example in non-Lorentz invariant theories with two derivative 
Lagrangians the commutator will vanish outside the lightcone of the field with the largest velocity. 
On the other hand in theories with two time derivatives and higher order spatial derivative terms 
one can see that subluminality is lost both from the analiticity discussion in sect. HJand from the 
fact that with a non-trivial choice of time slices also higher time derivative terms are produced 
preventing the definition of an Hamiltonian evolution. 

4 Perturbative Example: Xcf)^ on the Cylinder 

The arguments we gave in the previous section, although general and very intuitive, hide the 
non-triviality of the result in general non Lorentz invariant backgrounds. In particular they do 
not rely on perturbation theory and Feynman graphs, the reason is simple — the cancellation of the 
commutator outside the light cone is everything but manifest when expressed in terms of Feynman 
diagrams. In order to show this point and to give a independent check of the arguments we present 
here an explicit one-loop calculation of the commutator in a non Lorentz invariant background. 
Consider theory on a cylinder in D space-time dimensions (namely on R^-i X S^). At tree 
level the 2-point function in coordinate space is just the sum over images of the Lorentz invariant 
free-propagator whose imaginary part vanishes outside the light cone, namely 



where /i is the mass, R the size of the circle, x and y the coordinates over M and respectively 
and Ky{s) is the modified Bessel function. Since the commutator vev is proportional to the 
imaginary part of the Feynman propagator (see eq. ([7])), the former trivially vanishes too. At 
higher loop orders, however, the 2-point function on the cylinder is not just the sum over images 
of the 1-loop 2-point functions on the plane, since it receives also contributions from loops wrapping 
the circle (see fig. [T]). We present here the calculation of the commutator at one-loop order to 
show how non-trivial the cancellation is. We do not need to calculate explicitly the 2-point 



oo 



GT{x,y) 



G'::%x,y + Rn), 



n=—oo 




(38) 
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function, it is enough to show that its imaginary part vanishes. Should we start the calculation 
directly in Minkowski signature we would encounter many complex contributions, and for the 
imaginary part to vanish one needs highly non-trivial cancelations between them. So it is highly 
advantageous first to perform the computation in Euchdean space (where the 2-point function 
is real by definition) and then show that one can analytically continue the result to Minkowski 
space-time. The one-loop correction to the two-point function in position space is 



Si-ioopGR{x,y)^X'' J d^'-^pY, 

Py 



where here and in the following we suppressed all numerical coefficients and n(p, py) is the one-loop 
integral, which after the introduction of the Feynman parameter reads 



fdcfd^-'kY 

Jo J u 



Applying the Poisson summation formula 

n=— oo °° m=— oo 

to the ky sum we get 

/■I r ^ pVmiky+py^R 

n(p,P,)=y^ d^Jd ^E[fc2+(y2 + ^2 + (p2+p2)^(l_^)p- 

Applying again the Poisson formula to the Py sum we arrive at the following expression for the 
propagator 



Si-ioop GR{x,y) = 



n.m ^ y 



gipx^ipyy ^im{ky+py^)R+inpyR 



+ + + + (p2 +p2)^(l _ 

After evaluating the integral over k by introducing the Schwinger parameter we get 

/•I /• ipx ipy{y+nR+(,mR) X2-D/2 ( A*^ + (P^ +P?)C(l " ) 

/ di d'^pY, 2 , 2 , 2N2 (1^1-^)'"'' \ ^ , J, J ■ 

Jo J _ {P^Py^l^} \\m\R^ij? + {p^+pl)^{l-^)'^ 



n,m 



Because the integral in p and Py is the Fourier transform of a function that depends only on 
p'^ + Py, the propagator will be a sum of terms that are functions of the combination x'^ + y'^, where 
y = y + nR + ^mR. Namely 



GR{x,y) = X' f diYFrn{x^ + f) 



(39) 
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Figure 4: Plot of the {t, y) plane. The shaded hyperbolic regions correspond to points inside the light 
cone in the {t,y) plane (eq. (JW)- '^he shaded stripe is the region where the Wick-rotated Fm integral 
converges (eq. ^4^)- The blue hyperbolas are constant t^ — tp" curves. Therefore for every point outside 
the light cone (e.g. A or B) there exists a point (e.g. A' or B' , respectively) with the same value oft'^ — y'^ 
(and thus of Fm) such that the integral converges. 



By definition the propagator in Euclidean space (139|) is real. At x° = the Euclidean and 
Minkowskian 2-point functions coincide, so the Minkowski propagator is also real. As long as 
the analytic continuation of the propagator to Minkowski space is free from divergences also the 
propagator in Minkowski space will be real and the commutator will vanish. 

Notice that outside the light cone the following inequality holds: 

-t^ + f = -t^ + (^y^nR + imRf > -m^R^^{l - • (40) 

Indeed, the quadratic term in ^ is the same on both sides, and it is then trivial to check that the 
inequality holds outside the hght-cone at the boundary values ^ = 0,1. Points inside the light 
cone thus correspond to the hyperbolic shaded regions in fig. HI For — + > no analytic 
continuation is needed — analogously to the Lorentz invariant case F^^^^(t, y) = Fm{—t^ +^^) and 
its imaginary part vanishes. For —rn^R^^il — ^) < —t^ + < the proof needs one more step. 
If we analytically continue x° it the integrand in Fm for large po becomes 



7 \2~/ I -\(5-£>)/2 ' 



17 



in particular the leading exponential contribution reads 



When the integral is convergent, i.e. 

< m2i?2^(l - , (41) 

F^^^^{t, y) = Fm{—t^ + y"^) is real and Lorentz invariant with respect to the vector (t, y). We can 
thus analytically continue F^^'^^ everywhere outside the hght cone (eq. fHOl) ) using this invariance. 

Therefore, for every point outside the light cone Fm (and thus G/j) can be analytically continued 
without incurring into singularities, so that the Minkowski propagator is real and the commutator 
vev vanishes. We see that microcausality is indeed a highly non-trival property at the level of 
the Feynman diagrams; even in this relatively simple example one needs to follow the structure 
of the loop integrals at a detailed level (such as the precise way in which the Feynman parameter 
^ enters into different expressions) to see the required cancellations. 



5 Discussion 

While Lorentz invariance is not necessary for microcausality to hold, it is usually exploited in 
the standard proofs because it makes the vanishing of commutators outside the light-cone man- 
ifest. We gave two general arguments that show why also in non-Lorentz invariant backgrounds 
microcausality is expected to hold. The first makes use of path integrals and their manifest in- 
dependence on the coordinate choice. In this way two space-hke separated points can be chosen 
to lie on the same time-slice. This fact together with the existence of a unitary evolution in any 
given time-slicing ensures microcausality. The second argument instead is basically the statement 
that quantization is the procedure that maps functions of classical variables satisfying canonical 
Poisson bracket relations into the same functions of operators satisfying canonical commutation 
relations. In this way points that are causally disconnected at the classical level (vanishing Poisson 
brackets) are automatically causally disconnected also at the quantum level (vanishing commuta- 
tors). Let us discuss some potential subtleties in the above two arguments. First, in quantum field 
theory it may happen that, due to the presence of the infinite number of degrees of freedom, the 
commutators of operators are modified as compared to the classical Poisson brackets, the most 
famous examples being the anomalous violation of the chiral symmetries and the central extension 
of the Virasoro algebra in string theory. In the functional integral argument this subtlety trans- 
lates into the necessity to define a properly regularized measure that does not modify the causal 
structure of the non-regularized theory. We do not expect this subtlety to be relevant in the case 
under consideration, as the quantum anomalies arise due to UV effects and exhibit themselves 
as localized terms in position space, while we are discussing commutators of space-like separated 

^We are only interested in the m ^ case, since for m — Q the 2-point function is the sum over images of the 
Lorentz invariant 1-loop contributions, which respect niicrocausahty. 
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operators. For instance, dimensional regularization in curved space-time [13] does not modify 
the causal structure of the theory, although care should be taken when applying it in Lorentzian 
signature (other popular covariant methods of regularization, such as point-splitting or adding 
higher derivative terms in general do modify the causal structure of the theory). 

Another subtlety (present only in the functional integral proof) is that it is not totally obvious 
that the impossibility to connect two points by a time-like or null curve (which is the most 
natural definition of space-like separation) automatically implies that there exists a smooth space- 
like Cauchy surface on which the two points both lie. It is easy to check that this is true for 
some simple space-times and would be interesting to understand under what conditions these two 
properties are equivalent. 

We also reported an explicit one-loop calculation of the commutator in a non-trivial back- 
ground, which shows that even in the simple case where Lorentz invariance is broken just by 
boundary conditions, checking microcausality in the perturbative expansion may be rather cum- 
bersome. The fact that the vanishing of the commutator is not trivial in non-Lorentz invariant 
backgrounds may in principle provide a powerful tool to derive identities involving matrix elements 
of operators and the spectrum in weakly as well as strongly coupled QFTs. Consider for example 
a QFT on the cylinder M^~^ x 5*^; the Kallen-Lehmann-like representation for the commutator 
vev of two local operators reads 



(0| [Ot(x, y), O{0, 0)]|0) = J2Y1 1 "^a(^), ^|O(0, 0)|0)|' e^-y/^^A^^^x; m,{n)) , (42) 



where the formal sum in A runs over all the states of the theory with KK momentum n/R (and 
includes the integral on the corresponding phase-space), mx{n) is the invariant (D — l)-dimensional 
mass and A^~^{x]mx{n)) is the [D — l)-dimensional commutator vev of a free field with mass 
7n\{n). In principle we can now use the fact that we know that the commutator vanishes outside 
the light cone to get a non trivial relation. For instance we can integrate eq. fj42|) with a generic 
function f{t,y) outside the light cone (i.e. on the diamond (} = {t,y : t"^ < {y + nR)'^,'^n G Z}) 
and get 



in order not to get contributions from delta-like singularities localized on the light-cone we have to 
restrict to functions / that vanish on the light cone. The above relation gives sum rules involving 
the matrix elements and the spectrum of the theory and that are non-trivial especially when the 
theory is strongly interacting. It would be interesting to study whether such relations could be 
used in similar contexts like extra-dimensional models or QCD on the lattice. For instance it 
was noticed in [M] that in string compactifications the mass of the lightest modulus is always 
parametrically smaller than the KK scale. One might expect that such inequality follows from 
microcausality in the compactified space. 



n 



X 



J]J]|(A;m,(n),n|O(0,0)|0)r 



A n 




dydt cos{2nny/R) sm{m\{n)t) f{t,y) = 0; (43) 
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